Constrained Policy Optimization Via Bayesian World Models